Reinforcement Learning in Perfect-Information Games∗

نویسنده

  • Maxwell Pak
چکیده

This paper studies action-based reinforcement learning in finite perfectioninformation games. Restrictions on the valuation updating rule that guarantee that the play eventually converges to a subgame-perfect Nash equilibrium (SPNE) are identified. These conditions are mild enough to contain interesting and plausible learning behavior. We provide two examples of such updating rule that suggest that the extent of knowledge and rationality assumptions needed to support a SPNE outcome in finite perfect-information games may be minimal.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generalized reinforcement learning in perfect-information games

This paper studies action-based reinforcement learning in finite perfectinformation games. Restrictions on the valuation updating rule that that are necessary and sufficient for the play to converge to a subgame-perfect Nash equilibrium (SPNE) are identified. These conditions encompass well-known examples of reinforcement learning and are mild enough to contain other interesting and plausible l...

متن کامل

A reinforcement learning process in extensive form games

The CPR (“cumulative proportional reinforcement”) learning rule stipulates that an agent chooses a move with a probability proportional to the cumulative payoff she obtained in the past with that move. Previously considered for strategies in normal form games (Laslier, Topol and Walliser, Games and Econ. Behav., 2001), the CPR rule is here adapted for actions in perfect information extensive fo...

متن کامل

Policy Learning in Imperfect-information Infinite Dynamic Games

Dynamic games (DGs) play an important role in distributed decision making and control in complex environments. Finding optimal/approximate solutions for these games in the imperfect-information setting is currently a challenge for mathematicians and computer scientists, especially when state and action spaces are infinite. This paper presents an approach to this problem by using multi-agent rei...

متن کامل

Confusion and Reinforcement Learning in Experimental Public Goods Games ∗

We use a limited information environment to mimic the state of confusion in an experimental, repeated public goods game. The results show that reinforcement learning leads to dynamics similar to those observed in standard public goods games. However, closer inspection shows that individual decay of contributions in standard public goods games cannot be fully explained by reinforcement learning....

متن کامل

Hybrid learning in signalling games

Lewis-Skyrms signaling games (Lewis 1969; Skyrms 2010) have been studied under a variety of low-rationality learning dynamics (Barrett 2006; Barrett and Zollman 2009; Huttegger, Skyrms, Smead, and Zollman 2010; Huttegger, Skyrms, Tarrès, and Wagner 2014; Huttegger, Skyrms, and Zollman 2014). Reinforcement dynamics are stable but slow and prone to evolving suboptimal signaling conventions. A low...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006